Improve and re-release chapter 2 #911

burtenshaw · 2025-05-06T11:12:00Z

This is a minor improvement to chapter 2 to do these things:

remove tensorflow examples and videos
add a page on TGI, vLLM, and llama.cpp

sergiopaniego · 2025-05-14T12:48:08Z

chapters/en/chapter2/8.mdx

-		} 
-	]}
-/>
+# Optimized Inference Deployment


There is an issue in the building process. I think it comes from the framework s tags in this file. I think at least is missing <FrameworkSwitchCourse {fw} />

chapters/en/chapter2/3.mdx

sergiopaniego · 2025-05-14T13:18:16Z

chapters/en/chapter2/3.mdx


-The model can be used in this state, but it will output gibberish; it needs to be trained first. We could train the model from scratch on the task at hand, but as you saw in [Chapter 1](/course/chapter1), this would require a long time and a lot of data, and it would have a non-negligible environmental impact. To avoid unnecessary and duplicated effort, it's imperative to be able to share and reuse models that have already been trained.
+You'll notice that the tokenizer has added special tokens — `[CLS]` and `[SEP]` — required by the model. Not all models need special tokens; they're utilized when a model was pretrained with them, in which case the tokenizer needs to add them as that model expects these tokens.


This sentence feels convoluted

chapters/en/chapter2/8.mdx

chapters/en/chapter2/9.mdx

chapters/en/_toctree.yml

sergiopaniego · 2025-05-14T13:57:19Z

Super well written and informative, as always @burtenshaw. Great improvement over the previous iteration! Main issue is the failing build process. Rest are just nits 😄

burtenshaw · 2025-05-14T14:04:07Z

Super well written and informative, as always @burtenshaw. Great improvement over the previous iteration! Main issue is the failing build process. Rest are just nits 😄

Thanks @sergiopaniego . Working on the framework options now.

Co-authored-by: Sergio Paniego Blanco <[email protected]>

HuggingFaceDocBuilderDev · 2025-05-14T14:15:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Co-authored-by: Sergio Paniego Blanco <[email protected]>

burtenshaw added 5 commits May 5, 2025 09:45

remove tensforflow from existing material

fdbc85f

add page on TGI and vLLM

174321d

add llama cpp to frameworks page

8f41717

rename and format code

0fbde16

update toc

4bae938

burtenshaw requested a review from sergiopaniego May 12, 2025 08:54

sergiopaniego reviewed May 14, 2025

View reviewed changes

fix links and hfoptions usage

74aca90

Update chapters/en/chapter2/3.mdx

00761aa

Co-authored-by: Sergio Paniego Blanco <[email protected]>

burtenshaw and others added 4 commits May 14, 2025 16:16

Apply suggestions from code review

933135a

Co-authored-by: Sergio Paniego Blanco <[email protected]>

Update chapters/en/chapter2/8.mdx

0b4e2c4

Co-authored-by: Sergio Paniego Blanco <[email protected]>

Update chapters/en/chapter2/8.mdx

5deaf16

Co-authored-by: Sergio Paniego Blanco <[email protected]>

Update chapters/en/chapter2/8.mdx

d69d6d0

Co-authored-by: Sergio Paniego Blanco <[email protected]>

burtenshaw merged commit fac4675 into main May 23, 2025

burtenshaw deleted the improve-chapter2 branch May 23, 2025 07:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve and re-release chapter 2 #911

Improve and re-release chapter 2 #911

Uh oh!

burtenshaw commented May 6, 2025

Uh oh!

sergiopaniego May 14, 2025

Uh oh!

Uh oh!

sergiopaniego May 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sergiopaniego commented May 14, 2025

Uh oh!

burtenshaw commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 14, 2025

Uh oh!

Uh oh!


		The model can be used in this state, but it will output gibberish; it needs to be trained first. We could train the model from scratch on the task at hand, but as you saw in [Chapter 1](/course/chapter1), this would require a long time and a lot of data, and it would have a non-negligible environmental impact. To avoid unnecessary and duplicated effort, it's imperative to be able to share and reuse models that have already been trained.
		You'll notice that the tokenizer has added special tokens — `[CLS]` and `[SEP]` — required by the model. Not all models need special tokens; they're utilized when a model was pretrained with them, in which case the tokenizer needs to add them as that model expects these tokens.

Improve and re-release chapter 2 #911

Improve and re-release chapter 2 #911

Uh oh!

Conversation

burtenshaw commented May 6, 2025

Uh oh!

sergiopaniego May 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sergiopaniego May 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sergiopaniego commented May 14, 2025

Uh oh!

burtenshaw commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 14, 2025

Uh oh!

Uh oh!